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Abstract 

We consider numbers and sizes of independent sets in graphs with 
minimum degree at least d, when the number n of vertices is large. 
In particular we investigate which of these graphs yield the maximum 
numbers of independent sets of different sizes, and which yield the 
largest random independent sets. We establish a strengthened form of 
a conjecture of Galvin concerning the first of these topics. 

Given a graph G, let I{G) be the set of independent sets and let i{G) = 
\I{G)\; and for A; > let Ik{G) be the set of independent sets of order k and 
let ik{G) = \Ik{G)\. Thus i{G) = Ekyo^kiG). 

There are many extremal results on i{G) and ik{G), where G ranges 
over a certain family of graphs, for example, trees or regular graphs (see 
[l]-[3], [5]-[7],[10]). Here we investigate graphs with a given lower bound on 
their vertex degrees. For d > 0, let Gn{d) be the set of graphs of order 
n with minimum degree at least d. (Always n,k and d will be integers.) 
We are interested in which of these graphs yield the maximum numbers 
of independent sets of different sizes, and which yield the largest random 
independent sets. Let us discuss numbers first. 

Recall that the independence number a{G) is the maximum size of an 
independent set. Clearly a{G) < n — d for each G € Gnid)- Recently, 
Galvin [3] proved that, for n suitably larger than d, i{G) < i{Kd,n-d) for 
any G G Qn[d) that is not (isomorphic to) Kii n-d- Moreover, he conjectured 
essentially that for any d> 1, there exist integers N{d) and G{d) such that 
for each n > N{d), K^n-d maximizes over all graphs in Gnid) for each k 
satisfying C{d) < k < n — d; and he proved such a result in the case when 
d = l. 

We shall see that this conjecture holds even if d is allowed to grow slowly, 
and further we can take C{d) = 3. Observe that we need C{d) > 3. For, 
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each n- vertex graph has io{G) = 1 and ii{G) = n. Also i2{G) = (2) — e(G), 
where e{G) is the number of edges, and graphs G S Qn{d) can have ?2(G) > 
i2iKd,n-d)- (For example, if d is fixed and n is large and even, K(i,n-d has 
d{n — d) ^ dn edges, whereas a (i-regular graph has dn/2 edges.) We shall 
show: 

Theorem 1. Let 1 < d = d{n) = o(n^/^). Then for all sufficiently large n, 
for each graph G € Qn{d) and each k > 3 we have ik{G) < ik{Kci.n-d); o.nd 
if G is not Kd,n-d then i2{G) + ii{G) < i2{Kd,n-d) + H{Kd,n-d), and so 
i{G) < i{Kd,n-d)- 

A graph G € Qn{d) with a{G) = n — d has the form G = H + In-d for 
a graph H of order d and the empty graph In-d on n — d vertices. (Recall 
that for graphs G, G' with disjoint vertex sets, the sum G + G' denotes the 
graph obtained by adding all edges between them.) Let A"*^ denote the 
graph Ka + h- 

Denote by X{G) the size of an independent set chosen uniformly at 
random fromX(G). Recall that X is stochastically dominated by y, denoted 
by X <s Y, if P(X <t)> ¥{Y < t) for each t. 

If G G Gn{d) satisfies a{G) = n — d and G is not K^^-d^ then G is 
(isomorphic to) a proper subgraph of i^^^.^;, and so i{G) > i{Kdn-d)'^ ^^"^ 
it follows that P(X(G') < t) < P(X(if^„„^) < t) for t = and t = 1. 
Hence it is not the case that X{G) <s X{K2^_^). Nevertheless, our second 
theorem shows that, if we ignore independent sets of size at most 1, then 
of all graphs in Qnid), the graph -ft'^„_^ is the unique graph yielding the 
largest random independent sets. 

Theorem 2. Let 1 < d = d{n) = o{n^/^). Then for all sufficiently large n, 
for each graph G G Qn{d) other than Kdn-d' '^^ have 

¥{X{G) >t)< ¥{X{Kl^_^) > t) for each t = 3,...,n-d, 

and if a{G) < n — d then this inequality holds also for t = 1 and 2. 

This yields directly: 

Corollary 3. Ifd is as above, then for all sufficiently large n, for each graph 

G e Gnid) 

X(G)<,max{2,X(K^,„_,)}, (1) 

and 

ifa{G) <n-d then X{G) <, X{Kl^_^). (2) 
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Also, since E(X) = ^f^i^iX >t), we may obtain almost directly: 

Corollary 4. If 1 < d = d{n) = o{'n}/^), then for all sufficiently large n, 
for each graph G G Gn{d) other than -ft'^„„^, we have 

E(X(G)) < nX{Kl^^d)) < (n - d)/2. 

In order to prove these results, it turns out that the 'growth rates' 
of the numbers of independent sets are crucial quantities. For a graph G 
and positive integer k < a(G), let ak(G) := . ^'^l,-. . Thus afc(G) is 1/k 
times the average number of extensions of an independent {k — l)-set to an 
independent A;-set in G; or (roughly) the 'average number of extensions per 
vertex' at size k. 

To prove Theorem 1 we use two lemmas, one on growth rates ak{G) 
and one on the 'base case' ii{G). To prove Theorem 2 we need one further 
lemma, a general result on growth rates and stochastic domination. 

We adopt the following notations. For a graph G and integer d let 
A = A{G, d) = {ve V{G) : deg{v) > d} and B = V{G)\A; and let a = \A\, 
b = \B\. Also recall the standard notation that, if [/ is a set of vertices in G, 
then the neighbourhood T{U) is the set of neighbours of vertices in U, and 
the closed neighbourhood T[U] is T{U) U U. 

Lemma 5. (a) For each 1 < d < n and G € Quid), we have ak{G) < 
'^k{Kd,n-d) f"^ 3 <k < a{G). 

(b) Let 1 < d = d{n) = o{n^/^). Then for all sufficiently large n, for each 
G,K G Gn{d) with a{G) < n — d = a{K), we have ak{G) < ak{K) for each 
4<k< a{G). 

Proof. Let 3 < A; < a{G). Since each vertex degree in G is at least d, each 
/ G Ik-i{G) can be extended to at most n — d — k + 1 independent fe-sets. 
Call / good if this upper bound is attained, and otherwise call / bad. Note 
that / is good if and only if |r(/)| = d, if and only if each vertex in / has 
the same set of d neighbours. Also, each / is good if G is K'^^_^. 

Since each independent /c-set contains exactly k independent (A; — l)-sets, 
we have ik-i{G){n - d - k + I) > kik{G). Hence, ak{G) < But 
ak{Kln-d) = ""''fc^"^^ for A; = 3, . . . , n - d. This establishes part (a). 

Now we prove part (b). Let 4 < /c < a{G). Suppose first that k>d + 2. 
Let J be an independent set in G of size a{G) < n — d — 1. Let W be 
a set of d + 1 vertices outside J, and note that each vertex in W has at 
least one neighbour in J. Since k — 1 > d + 1 we may pick a {k — 1)- 
subset / of J with r(/) 2 W, and so / is bad. Now, since there is a bad 
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independent {k - l)-set, afc(G) < B^^i^t+l. Further, ak{K) = for 
each k = d + 2, . . . ,n — d, so this case is done; and so to prove part (b) we 
may assume that 4 < k < d + 1. 

Assume also that n > 2d (as we may). Write K = H + In-d foi^ some 

graph H of order d. Then ak{K) = (i-d-^\^.''°[^ly Since ik-i{H) < (^li), 

for each k < d + 1, 

/n—d\ 

> ^ 1^,^^ > . ' ^ ( I • (3) 

Let p and (7 denote the numbers of good and bad sets in respec- 
tively, so p + g' = ik-i{G). Then 

kik{G) < p{n — d — k + 1) + q{n — d — k) = (p + q){n — d — k + 1) — q, 

so 

"^^^^^ k WTTY 

Assume for a contradiction that ak{G) > akiK). Then it follows using (3) 
and (4) that 

^q^^^-'-'^'ikU)/{l-t)< in-d'-k\i)^-^ - 

Observe that, since k > A, the final bound above is 0{d^n~'^) = o(?i~^). 
Thus certainly p > 0. 

Claim: For each good independent (A; — l)-set / in G there is a vertex 
w^IU r(/) such that T{w) / r(/). 

We will prove the claim later: suppose for now that it holds. Then from 
each good independent (k — l)-set / we may construct a bad independent 
(k — l)-set /' by deleting a vertex u from / and adding a vertex w as in 
the claim. This gives at least p{k — 1) > 3p constructions. Also, in each 
bad independent {k — l)-set /' which has been constructed, we can identify 
the vertex w added (since the other k — 2 > 2 vertices all have the same 
neighbourhood). Thus each bad independent (k — l)-set /' is constructed at 
most n — k + l<n — 3 times. Hence 

q > 3p/{n - 3) > p/{n - 1) 

and so q/{p + q) > 1/n, which contradicts (5) (for n sufHciently large, since 
k > 4). 
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It remains to prove the claim. Recall that B = {v ^{G) : deg(f ) = d}. 
Let I be a good independent {k — l)-set. Note that I ^ B and |r(/)| = d. 
If 1^1 = a > d + 1 then for w we may pick any vertex in yl \ r(/). So we 
may assume that a < d. 

Let Bi = {v £ B : r{v)nB / 0} and B2 = B\Bi. Since a{G) < n-d < 
\B\ we have E{B) / and so Bi / 0. Either I ^ Bi 01 I Q B2, since each 
vertex in / has the same set of d neighbours. If / C then / C T{v) for 
some V € Bi, and so for w we may pick any vertex not in r(/) U T{v) (at 
least n — 2d > 1 choices). If / C i?2 then for w we may pick any vertex in 
Bi. This completes the proof of the claim, and we are done. □ 

The previous lemma concerns ratios; the next considers the base case. Of 
graphs in Qn{d), clearly a d-regular graph has the most independent 2-sets: 
we look at the number of independent 3-sets. We first give a formula for 
13(0) for any graph G. Let ti be the number of induced subgraphs of G on 
three vertices with i edges. Then 

to + ti+t2+t3, 
tl +2t2 + 3t3, 
t2+3t3. 

Hence, 




where t{G) = is the number of triangles. For example, if G is a d-regular 
graph then 

Z3(G) = Q -l<in(n-2)+nQ 

'] --dn+ -d{d'^ + U + 2)- t(G). 
/ 2 6 

Lemma 6. Let 1 < d = d{n) = o{n^/^). For all sufficiently large n, if 
G,K € Qn{d) are such that a{G) < n — d = a{K), then «3(G) < i3{K) — 
n/2 + 1. 



n 



e(G)(n-: 
^ / deg{vi) 
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Proof. Our proof relies on (6). Consider G € Qn{d) with a{G) < n — d. 
We first show that we may assume without loss of generality that the set A 
of vertices of degree > (i is a non-empty independent set, and then that it 
suffices to prove (7) below; then we prove (7) by considering four cases for 
a = \A\. 

Suppose that G is d-regular. Then by the above we have 
UG) <l^~'^^- \dn + U{d^ + 3d + 2). 
But i^{K) > ("3'^). Thus, if d = 1 then 

hiG) <(''~'^)-n/2 + l< h{K) - n/2 + 1; 
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and if d > 2 then 



i3{G) < 3 - n + Oid') < n{K) - n/2 

for n sufficiently large. Hence we may assume that G is not regular, and so 
A is non-empty. 

Now repeatedly delete edges between vertices of degree > d, as long as G 
keeps satisfying a{G) < n — d. We end up with some graph G' € Qn{d) with 
a{G') < n — d. Suppose that there is an edge uv G E'{A') after this step (we 
use E' and A' to refer to G'). Then there exists an (n — d)-set I such that 
E'{I) = {uv}. Let J = V{G') \ I, so \ J\ = d. Since degG/(n), degG',(?;) > d 
and degQ, (w) > d for each other vertex w G I, every possible edge between 
/ and J is present in G' . Therefore, since there are {n — d — 2) 3-subsets of 
/ containing u and v, 



,^r^ fn — d\ , , , fd\ fn — d\ n 



for large n, since d = o(n^/^). Hence, we may assume that A is independent. 
For each Vi G A, let = deg{vi). Observe that 2e{G) = rj +{n — a)d. 
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Thus, from (6), 
2h{G)-2\ 

a 

= "E^i +{n-a)d\{n-2) + ^ri{ri-l) + {n-a)d{d-l)-2t{G) 



i=l 1=1 

a 



= E - n + 1) - (n - a)d{n -d-l)- 2t{G) 

i=l 

= -dn{n-d-l) + hd{G), 
where 

a 

hd{G) = ^ r,(r, - n + 1) + ad{n - I - d) - 2t{G). 
1=1 

Thus 

''^^^ ^ Q - ^fin(n - d - 1) + 

''~'^)-ldn + ld{d^ + 3d + 2) + l-hd{G). 
6 J 2 b 2 

Observe that here only ^hd{G) varies with G G Gn{d). Since iz{K) > ("3^^), 
by the last equality 

hd{K) >dn- -d(d2 + 3d + 2) = (1 + o(l)) dn. 
3 

Thus it suffices to show that 

hd{G) < {d - l)n + 0{d^) (7) 

and the remainder of the proof is devoted to establishing this result. 

Recall that we are assuming that in G the set A of vertices of degree > d 
is independent. Thus d + 1 < < n — a for each i = 1, . . . ,a. Consider the 
function g(x) = x{x — n + 1) = —x{n — 1 — x) for real x. This is decreasing 
for X < (n — l)/2 and increasing for x > (n — l)/2. We now break the proof 
of (7) into four cases: a>d + 2, a = d+l, a = d, and 1 < a < d — 1. 

Suppose that a > d + 2. Then each d+1 <rj < n — d — 2, so g{ri) < 
(d + l)(d + 2 - n). Hence, 

hdiG) < a(d+l)(d + 2-n) + ad(n-l-d) 

= a(-n + 2d + 2) (8) 
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and so (7) holds. 

Suppose that a = d + 1. Then d + 1 < ri < n — d — 1 for each i, and 
Yli=i ^« ^ d{n — a) = d{n — d — 1). Thus at most d — 1 of the are equal 
to n — d — 1, and so 

a 

^g{ri) < -id-l)d{n-d-l)-2{d+l){n-d-2) = -n{d^ + d+2) + 0{d^). 

i=l 

Hence hd{G) < -2n + 0{d^), and so (7) holds. 

Suppose that a = d. Since a{G) < n — d, e{B) > 0. It follows that 
Yli=i — ^(^ — d) — 2 < d{n — d) — 1. Hence not all d of the ri are equal 
to n — d, and so 

hd{G) < {d-l){n-d){l-d) + {n-d-l){-d)+d^{n-l-d) 
= {d-l)n + 0{d^) 

as required. 

Finally, suppose that 1 < a < d — 1. Consider Vi G A. Suppose that 
ri > n — d —1. Then the edge-boundary of T[vi) has size at most r^a + (n — 
a — rj)(i < rja+d((i+l — a), and so 2e(r(i;j)) > ri{d—a) — d{d+l — a). Hence, 
twice the number of triangles containing Vi is at least ri{d — a) — d{d+l — a). 
Also, using first that ri < n — a and then that ri > n — d — 1 we have 

g{ri) - ri{d - a) = ri{ri - n+l - d + a) < ri{l - d) < {n - d - 1)(1 - d). 

On the other hand, if < n — d — 2, then g{ri) < {d + l){d — n + 2). Let 
I = \{i : ri > n — d — Then h(i{G) is at most 

[9{ri)-ri{d-a)+d{d+l-a)]+ ^ g{ri) + ad{n-l-d) 

i:ri>n,— d— 1 i:ri<n—dr-2 

< l[{n-d-l){l-d) + d((i+l-a)] + (a-/)(d+l)(d-n+2)+ad(n-l-(i) 

< a7i + 0{d^) < {d-l)n + 0{d^) 

as required. □ 

With the last two lemmas, we may now prove Theorem 1, establishing 
a stronger version of the conjecture of Galvin [3] mentioned earlier. 

Proof of Theorem 1. If q;(G) = n — d then G is (isomorphic to) a supergraph 
of Kd^n-d and the result is trivial: so we may assume that a{G) < n — d. 
Let us also assume that n is large. Let K € Quid) with a{K) = n — d. 
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Since isiG) < i3{K) — n/2 + 1 by Lemma 6 (b), by Lemma 5 we have 
ik{G) < ik{K) for all A; > 3. In fact, i^iG) < i4{K) - ^(n^) since a4{K) = 
0(n). On the other hand, e(G) > dn/2, so that Z2(G) -i2(i^) < dn/2. Thus 
*2(G) + «4(G) < i2{K) + i4,{K), and we are done. □ 

To prove Theorem 2, as well as the two corollaries, we need one further 
lemma, which is a general result on growth rates and stochastic domination, 
adapted from Lemma 2.4 of [9]. Given a finite sequence of positive real 
numbers x = {xo,xi, . . . , Xg), let S{x) = X]a:>o ^fc- Define a random variable 

X = X{x)hyFlx = k) = Xk/S{x). 

Lemma 7. Let xq, yo > 0, let 1 < a < b be integers, and let ai, . . . ,aa > 
and f3i,...,f3b > 0. For i = l,...,a, let Xi = xoYIq^j^^uj; and for 
i = 1,...,6, let yi = yo no<i<j /^i- ^ = {xo,xi, . . . ,Xa) and y = 

iyo,yi, ■ ■ ■ ,yb), and denote X{x) by X and X{y) by Y. If ai < Pi for 
each i = l,...,a, then X <s Y. Further, if these conditions hold, and 
(Qi,...,Qa) / (/3i, . . . ,/3fe), then 

^{X >t) < ¥{Y > t) for each t = l,...,b. 

Proof. By replacing y^ by Ylj>ayj^ assume that b = a. It suffices 

to consider the case when Oi = j3i for all i except jo where aj^ < /3jg. Since 
F{X < a) = P(y < a) = 1, it suffices to prove F{X < t) > F{Y < t) for 
t = 0, ... ,a — 1. Note that we may rescale Xj,yj's without changing the 
distribution. 

Suppose t satisfies < t < Jq — 1. Rescale to = yo = 1. Then Xi = yi 
for alH < t and S{x) < S{y). So P(X < t) = > = ¥{Y < t). 

For t such that jo ^ ^ ^ — 1, we rescale to Xj^ = yj^. Then Xi = yi for 
all i = jo, jo + 1, . . . , a and 5(x) > S{y). Hence, F{X > t) < F{Y > t) and 
so F{X <t)> F{Y < t). □ 

Proof of Theorem 2. There are two cases, depending on whether a{G) < 
n — d ov a{G) = n — d. 

(a) Let G £ Qn{d) with a{G) < n - d. For k > I, let denote 
ak{K^^_^). Then ai{G) = = n. By Lemma 5 (a), ak{G) < a^. for 
3 < A: < a{G). 

If a2{G) < a\ then directly from Lemma 7 we have F[X{G) > t) < 
F{X{K^^_^) > t) for each t = 1, . . . , n — d, and we are done. So we may 
suppose that a2{G) > a"^] that is i2(G) > where denotes ik{K2n-d)- 



9 



Let X be the ifc-vector for G (up to Xn-d), let z be the ifc-vector for 
^dn-d-i ^^"^ 1"^^ ^ agree with x in the first three places, and agree with z in 
the remaining places; that is, 

X = (xo, XI, ... , Xn-d) = (1, n, i2{G),i'i{G),ii{G), in-d{G)), 

y = {yo,yi, - ■ ■ ,yn-d) = il,n,i2{G),i*2,il, . . . ,in-d) 

and 

Z = {zo,Zi, . . .,Zn-d) = (l,n,i2,«3,il, . . ■,in-d)- 

Let 3 <t <n-d. By Lemma 5 (b) with K = K^^-d^ ^^"^ ^^'^^ 4 < A; < a{G) 
we have < Moreover, by Lemma 6, i-^{G) < so that Si < M_ 

Then by Lemma 7, 

F{X{G) >t)= F{X{x) >t) < r{X{y) > t). 

Also 

nX{y) >t)< nX{z) >t) = F{X{Kl^^,) > t) 

since Siy) < S(z). Hence P(X(G) > t) < P(X(ii"*„_^) > t) as required. 

To complete the proof for this case, note that by Theorem 1, i{G) < 
ii^ln-d)^ SO that 

P(X(G) < 0) = l/i{G) > lA(irrfVrf) = nXiKl^^d) < 0), 
and similarly 

F{X{G) <!) = (! + n)A(G) > (1 + n)/.(ii',%_,) = P(X(jq,„_,) < 1). 

(b) It remains to consider the case when a{G) = n — d and G is not 
^dn-d- Then G may be obtained from K'^^_^ by deleting at least one edge 
from the part. Thus i{G) > and the ifc-vector x of G may 

be obtained from the i^-vector z for K^^_^ by adding positive integers to 
some entries amongst the first d + 1 including adding at least 1 to Z2 . It is 
immediate that P(X(x) > t) < F{X{z) > t) for each t = d + 1, . . . ,n - d. 
Let 2 < t < d - 1. Then 

P(X(z) <t) = 

To obtain P(X(x) < t) from the last ratio we add at least 1 to the numerator 
and at most 2"^ to the denominator. Thus the numerator increases by a factor 
(l+r2(n~'^)) and the denominator increases by a factor at most (1+2"^""^"^^). 
So overall the ratio increases (for large n), that is ¥{X{z) < t) < P(X(x) < 
t), as required. □ 
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We noted earlier that Corollary 3 follows directly from Theorem 2, so it 
remains only to prove Corollary 4. 

Proof of Corollary 4- If ce(G) < n — d, the result follows directly from (2). 
Suppose then that a{G) = n — d, and let n be sufficiently large that 
E[X{K^^_j^] > d. Then the average size of the sets which are inde- 
pendent in G but not in K^^_^ is at most d < K[X{K2^_^)], and so 
E[X(G)]<E[X(i^*„_d)]. ' ' □ 

We remark that with an analogous method, a weighted version of the 
statements can be proved. Let I{G, A) = X]/t>o ^fc(^)'^'^ independent 
set polynomial of G ([4], [8]). Instead of a uniform sampling of independent 
sets of 1(G), we fix A > and pick a given independent k-set with probability 
X^/I{G, A). Then under this sampling, the analogous versions of Theorem 2 
and its corollaries hold. 
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